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Abstract. In this paper we are concerned with the conjecture that, for any set of generators S of the 
symmetric group Sym(n), the word length in terms of S of every permutation is bounded above by a polynomial 
of n. We prove this conjecture for sets of generators containing a permutation fixing at least 37% of the points. 



(Sj ■ 1. Introduction 

| For a group G and a set S of generators of G, we write T(G, S) for the Cayley graph of G with connection 
CN ■ set S, that is, the graph with vertex set G and with edge set {{g, sg} \ g E G, s E S}. The diameter diam(r) 
of a graph T is the maximum distance among the vertices of T and, in the case of a Cayley graph T(G, S), 
it is the maximum (over the group elements g E G) of the shortest expression g = s 1 ^ ■ ■ ■ s % ™ , with s/% E S 
^ and i]f E {—1, 1}. We define the diameter of a group G as 

diam(G) := max{ diam(r(G ! , S)) \ S generates G }. 

A first investigation of the diameter of Cayley graphs for general groups was undertaken by Erdos and 
Renyi [7j. Later Babai and Seress [I] obtained asymptotic estimates on diam(G) depending heavily on the 
group structure of G. In particular the results in [3] highlight the discrepancy between the diameter of 
Cayley graphs of groups close to being abelian and the diameter of Cayley graphs of non-abelian simple 



o 

C3 ' groups. Moreover, [3] contains the following conjecture of Babai. 
5 ' 

i h Conjecture 1.1 ([H Conjecture 1.7]). There exists c > such that, for all non-abelian simple groups G, 
diam(G) < (log|G|) c . 



> 

in 



in 
o 



The conjecture remains open, although significant progress has been made. In particular, starting with 
the work of Helfgott on the groups PSL(2,p) and PSL(3,p) [HIE] and based thereon, there has been a series 
of results O [HJ [20] proving the conjecture for finite simple groups of Lie type of bounded rank. The best 
statement known at the time of writing is by Pyber and Szabo [20] and says that there exists a polynomial 
c such that, for a finite simple group G of Lie type of Lie rank r, we have diam(G) < (log |G|) c ( r ). For the 
sake of comparison, Conjecture 11.11 asserts that c should be a constant rather than a polynomial. 

The proofs of these theorems make use of new results in additive combinatorics, specifically on growth 
in simple groups. We note that the difficulties in generalizing these results from groups of bounded rank 
to those of unbounded rank seem closely related to difficulties in proving Conjecture [T7TJ for the alternating 
^ , groups Alt(n). In both cases (that is, classical groups of unbounded rank and alternating groups) there 
are known counterexamples to general "growth results" for sets (see for example j!9l [2D. 23J ) , which were 
central to the approach used to prove Conjecture 11.11 for groups of Lie type of bounded rank. What is more, 
these two classes of counterexample are, in some sense, related. 

In this paper we focus on the case where G = Alt(n) or Sym(n). Let Q be a set of size n. For g E Sym(fJ), 
define the support of g by supp(<?) = {7 E £1 \ ^ 7}. Observe that supp(g) is equal to the complement in 
f2 of the fixed set, fix(g), of g. Babai, Beals and Seress [2] proved the following result. 

Theorem 1.2 ([2]). For every e < 1/3 there exists c £ > such that, if G = Sym(n) or Alt(n) and S is a 
set of generators of G containing an element g with |supp(g)| < en, then 

diam(r(G,5)) < c £ n 8 . 

In this paper we provide a variant of the argument in [2] to prove the following stronger theorem. 
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Theorem 1.3. Let C = 0.63. There exists c > such that, if G = Sym(n) or Alt(n) and S is a set of 

generators of G containing an element g with |supp(g)| < Cn, then 

diam(r(G, S)) < 0(n c ). 

We do not try to minimize the exponent c in the theorem. Our arguments give c < 78, but with some 
more work the bound on diam(T(G, S)) can be improved to at least 0(n 66 ). 

Theorem 11.31 also extends to directed graphs. Given G = (S), the directed Cayley graph T(G,S) is the 
graph with vertex set G and edge set {(g,sg) : g G G,s £ S}. Analogously to the undirected case, the 
diameter of r(G, S) is defined as the maximum (taken over g £ G) of the shortest expression g = s\ - ■ ■ s m , 
with each s k £ S. By a theorem of Babai JJ Corollary 2.3], diam(f(G, S)) = O (diam(T(G, 5)) • (log |G|) 2 ) 
holds for all groups G and sets S of generators, so we immediately obtain the following corollary. 

Corollary 1.4. Let C = 0.63. There exists d > such that, if G = Sym(n) or Alt(n) and S is a set of 

generators of G containing an element g with |supp(<?)| < Cn, then 

diam(f (G, S)) < 0(n d ). 

We note that for arbitrary sets of generators the best known bound is quasipolynomial, by a recent result 
of Helfgott and Seress: 

Theorem 1.5 ([13J). For G = Alt(n) and Sym(n), diam(G) = exp(0((logn) 4 loglogn)). 

The machinery developed in this paper turns out to have application to other questions within permutation 
group theory. Indeed it is possible to use variants of the results given in Section [3]to recover, and strengthen, 
classical results concerning multiply transitive groups due to Manning [TBJ E3 US] an d Wielandt [25] . This 
will be the subject of a forthcoming paper [TO] . 

1.1. The main ideas. It is well-known and easy to see that if a set A of generators of G = Alt(n) or 
Sym(n) contains a 3-cycle t then every element of G can be written as a word of length less than n 4 in A. 
Indeed, repeatedly conjugating t by A gives all 3-cycles as words of length less than n 3 , and each element 
of Alt(n) is a product of at most [_ n /2j 3-cycles. Finally, if G = Sym(n) then one more multiplication by A 
gives words for all elements of G. Thus, given any set S of generators of G, in order to prove Conjecture II. 1[ 
it is enough to construct a 3-cycle as a word in S of polynomial length. 

We may try to reach a 3-cycle in stages, by constructing elements of smaller and smaller support. Up 
to very recently, the only subexponential method to obtain an element of support less than cn, for some 
constant c < 1, from arbitrary generating sets was in [3J. In that paper, iteration of the support reduction 
was utilized to prove diam(G) = exp((l + o{\))\Jn log n), the only subexponential bound until Theorem 1 1.51 

Theorem 1 1 . 2 1 may be interpreted as a reduction of Conjecture [TTTJ for alternating groups, to the problem of 
constructing an element g of moderately small support as a short word in an arbitrary set S of generators. 
The proof of Theorem 11.21 in [2J is based on the following observations. 

(BBS1) If |supp(a)| < en for some e < 1/3, a 6 G, G = Alt(n) or Sym(n), and r is a random element 
of G then, for b = a r , the commutator [a, b] = a~ l b~ l ab has support smaller than a with positive 
probability. 

(BBS2) In (BBS1), it is not necessary that r is a uniformly distributed random element of G. It is enough 
that, for some constant I, r maps a sequence of distinct elements of length I from the permutation 
domain nearly uniformly to all other sequences. Furthermore, random words r on any set S of 
generators of G, of length n°^\ satisfy this property. 
In [2], the number I = 3 was chosen and a 3-cycle was constructed in O(loglogn) applications of (BBS1). 
A major conceptual novelty of [2] is that besides the natural action of G on n points and the action of 
G on itself as in T(G,S), it is beneficial to work with other actions of G. This principle is more clearly 
formulated in [13| . In [2] and in the present paper, the action of G on sequences of length t from the natural 
permutation domain is used, while in [TSJ other actions are utilized as well. 
Chronologically, we made three improvements to the argument in [2]. 
(NEW1) The conclusion of (BBS1) holds for e < 1/2, implying a version of Theorem 11.31 with C < 0.5. 
(NEW2) With positive probability, the commutator [a, b] has many fixed points, and also contains a significant 
number of 3-cycles. Thus, if |supp(a)| < en for some e < 0.585, then [a, b} 3 has support smaller than 
a. This implies Theorem 11.31 with C = 0.585. 
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(NEW3) With positive probability, the permutation [a, b x ][a, b] has many fixed points and 2-, 3-, 4- and 
5-cycles. So, for e < 0.63, ([a, 6 _1 ][a, 6]) 60 has support smaller than a. 
In this paper we only prove the strongest version based on (NEW3). We have to overcome several technical 
difficulties: (i) the analysis of the local behaviour (i.e., finding how o and b should interact on small subsets 
A of the natural domain such that [a, & _1 ][a, b] forms a short cycle on some points of A); (ii) ensuring 
that ([a, ft -1 ] [a, 6]) 60 is not the identity of G; and (iii) handling the special case when the originally given 
generator a has order 2 x 3 y for some x, y > 0. We shall apply the argument of (BBS2) with I = 26. 

The structure of this paper is as follows. In Section [21 we collect basic concepts regarding groups and 
graphs, and introduce the central notion of a(3-trees. These are the objects describing the possible local 
interactions of a and b. We also introduce our probabilistic method. In Section [3] we give a graph-theoretic 
technique for estimating the number of fixed points of w(a, b), where w(a, f3) is a reduced word in a and (3, 
and a and b are particular conjugate permutations of Sym(f2). In SectionSl we apply the results of Section[3] 
to the word [a, f3~ 1 ] [a, (3] = a -1 (3af3~ l a~ 1 f3~ l a/3 and we prove Theorem 11.31 In Section [5] we discuss some 
possible extensions of Theorem 11.31 
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phase of this research. In addition expenses for N.G. and H.H. to visit the University of Western Australia 
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2. Basic concepts 

In this section we collect definitions and basic results that will be needed in the proof of Theorem 11.31 

2.1. Permutation groups. Let f2 = {1,2, ... ,n}. We use Sym(n) and Sym(J7) interchangeably; more 
exactly, we use Sym(O) when we emphasise the action of Sym(ra) on $7. Let S C Sym(n) and k, I G Z + . 
Define 

S e = {si ■ ■ ■ S£ | s\, . . . , S£ G S}; S^ 1 = {s _1 | s G S}. 
For ugl! and a,g£ Sym(n), we write uj 9 for the image of ui under g; and a 9 for g~ 1 ag. We denote by O,^ 
the set of fc-tuples of distinct elements of S7 and we write n^) = \Q(k) I = n ( n — 1) • • • (n — A; + 1). 
We shall use the following result of J. Whiston [24] . 

Lemma 2.1 ([24]). Any set S of generators for G = Sym(n) or Alt(n) contains a subset A of cardinality 
less than or equal to n — 1 that also generates G. 

If A and B are two sets of generators for G with A C B then clearly diam(r(G, B)) < diam(r(G, A)). 
Therefore, Lemma [2.11 implies that it is enough to prove Theorem 1 1.31 for sets S of generators of size at most 
n which contain an element g of small support. 

2.2. Graphs. In this paper a graph X is a finite connected directed graph. Moreover, we allow loops on the 
vertices of X and multiple edges. We do not assume that X is strongly connected, i.e., it is possible that 
there is no directed path between some vertices x and y of X. We write V(X) for the set of vertices of X 
and E(X) for the set of edges of X. An edge e running from vertex i to vertex j will be written but 
we warn that this notation is ambiguous as there may be more than one such edge. 

We define an a/3-graph, T say, to be a graph together with a label, a or (3, attached to every edge. We 
require that for each vertex v G V(T) and for each 7 G {a,/3}, v is incident with at least one edge labelled 
by 7. El We also require that at most one edge starting at v is labelled by 7 and at most one edge ending 
at v is labelled by 7. Here a loop at v counts as one incoming and one outgoing edge for v. Notice that all 
vertices of an a/3-graph have in-degree at most 2 and out-degree at most 2. For 7 G {a, (3}, we define T 7 to 
be the subgraph of T with vertex set V(T) and edge set the set of edges of T labelled by 7. 

^We impose this condition because it is necessary in some of the arguments used in Section It is not a priori necessary 
to our strategy for analysing the fixed points of words; this condition limits our implementation to those reduced words w on 
{a, ft, a -1 , ft -1 } for which all exponents are equal to ±1. 
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We say that a cycle C of T is monochromatic if all of its edges are labelled a (resp. /3), that is, C is 
a sequence of vertices {y\, . . . ,v r ,v r+ i) with v r+ \ = v\ and r > 2, and where for each i G {1, . . . ,r} the 
ordered pair (vi,Vi + i) is an edge of T labelled a (resp. f3). 

Given permutations a, b G Sym(fi) and an injective map t : V(T) — > fi, we say that T Q is hosted by (t, a) 
if (xi) a = yi for each edge or loop (x,y) G E(T a ), Similarly, is hosted by (t,b) if = ?/i for each edge 
or loop G E{Tp). Finally, T is hosted by (i,,a,b) if T Q is hosted by (i, a) and is hosted by (i, 6). 

Let T be an a/3-graph. Observe that for 7 G {a,/3}, the connected components of T 7 are of three types: 

(1) j-loops: isolated vertices, namely the vertices v of T having a loop at v labelled with 7; 

(2) j- cycles: monochromatic cycles all of whose edges are labelled 7; 

(3) j-paths: maximal directed paths such that all edges are labelled with 7. 

We denote by £ 7 (T) the number of 7- loops and by p 7 (T) the number of 7-paths and 7-cycles. 

We say that an a/3-graph is an af3-tree if all undirected cycles are monochromatic (and so necessarily 
they are also directed cycles). Note that an a/3-tree may not be a tree in the usual graph-theoretic sense. 
The following result will be crucial. 

Lemma 2.2. Let T be an a(3-graph. Then p a (T) +pp(T) + l a {T) + lp(T) < \V(T)\ + 1. Moreover, equality 
holds if and only if T is an a /3 -tree. 

Proof. Let B be the graph with vertex set the set of a-loops, a-paths, a-cycles, /3-loops, /3-paths and /3-cycles 
of T. We declare two distinct vertices x and y of B adjacent if there exists a vertex v of T such that v is 
incident with both x and y. By construction, B is bipartite (with the a-objects comprising one class of the 
bipartition and the /3-objects the other) and |V(i?)| = p a (T) + pp{T) + l a (T) + lp{T). By the definition of 
a/3-graphs, each vertex v of T is incident with exactly one component of T a and with exactly one component 
of Tp. Hence v defines exactly one edge in B and so \E(B)\ = \V(T)\. Since T is connected, the graph B is 
connected and so |V(.B)| < |22(.B)| + 1 = |V(T)| + 1, proving the first claim. 

Observe that if T contains a non-monochromatic cycle, then B contains a cycle. Conversely if B contains 
a cycle, then T must contain a non-monochromatic cycle. We conclude that B is a tree if and only if T has 
no non-monochromatic cycles, and the second claim follows from the standard fact that B is a tree if and 
only if \V(B)\ = \E(B)\ + 1. □ 



We end this subsection with some more definitions. For an a/3-graph T and for < 5 < 1, we define 
5t := (1 — 5)' Q ( T ) + W T )<5 Pa ( T ) +p < 9 ( T ). An isomorphism between a/3-graphs T\ and T2 is defined to be a 
bijection cp : V{T\) — > V{T%) that preserves edges and edge-labels. If, for some x,y G V(T\), there are two 
directed edges from x to y in T\ (with necessarily different labels) then in T2 there are also two directed 
edges from xf> to yip. We define an automorphism of T to be an isomorphism between T and itself. The set 
of all such permutations is the automorphism group Aut(T). 



2.3. Words and graphs. Let T be an a/3-graph and w = w\W2 ■ ■ ■ Wk be a reduced word with w%, . . . , G 
{a, a -1 , /3, /3 -1 }. We say that T admits w, if there exists a vertex x of T such that by starting at x and by 
tracing the edges and the loops of T with labels (w±, 11)2, ■ ■ ■ , Wk), we visit all vertices and edges of T and we 
return to the vertex x (here by abuse of notation, we interpret the label a" 1 as the label a with the edge 
pointing in the opposite direction, and a similar convention holds for (3). The vertex x G V(T) is called a 
fixed vertex for (T, w); note that there may be more than one such vertex in T. 

The following Table [1] contains pairwise non-isomorphic a(3-trees that admit the word w = [a, [a, 0\. 
Here dashed (red) lines are labelled with a, solid (blue) lines are labelled with /3, and for simplicity of 
drawing all loops are omitted; thus, any vertex which is not incident to a red line (or blue line, respectively) 
is in fact incident to an a-loop (or /3-loop, respectively). The fixed vertices are written as red stars, and 
under each graph T we have written the value for 5t- 
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Table 1. a/3-trees admitting w = [a, /3 _1 ][a, 0\ = a~ 1 fiafi~ x a~ x I3~ 1 a(3. 
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Next, we explain how a/3-trees can be used to estimate the number of fixed points in certain permutations. 
Let T be an a/3-tree admitting the reduced word w = w±W2 ■ ■ -Wk, and let x be a fixed vertex of (T,w). 
Starting at x and tracing w we obtain a sequence U = (x = xo, x%, . . . , Xk = x) of vertices of T such that, 
by definition, all v G V(T) occur in U. 

Let a,b G Sym(f2). If T is hosted by (t, a, 6) for some injective map t : V(T) — > then, starting at 
xz. and tracing the word w by using a and b for the labels q and /3, respectively, we obtain the sequence 
Ul = (xqL, x\i, . . . , Xki). In particular, xi and w uniquely determine the entire map t and xi is a fixed point 
of the permutation w(a,b). 

For fixed a, 6, and to as above, let T\,...,T m be pairwise non-isomorphic a/3-trees admitting u;. For 
1 < j < m, we denote the number of fixed vertices in Tj (with respect to w) by f ixed (Tj ) . Also, let Ij be an 
index set such that for z E Ij, there exists i z : V(Tj) — > Q, with Tj hosted by (i z ,a,b). 



Lemma 2.3. Let a,b,w,m,Tj,Ij be as in the previous paragraph. 

(i) If x is a fixed vertex of (Tj 1 ,w), y is a fixed vertex of(Tj 2 ,w), and xl Zi 

Z2 € Ij 2 then j\ = j2 and x,y are in the same orbit of Aut(7j 1 ). 
(ii) The number of fixed points ofw(a,b) is at least 



yi Z2 for some Z\ G Ij 1 and 



te(w(a,b))\ >J2 



\Ij\- fixed (Tj) 
|Aut(T.-)| ' 



Tj 2 l Z2 , the map i zx t is a label-preserving isomor- 
j2 and l z ±Lz 2 G Aut(Tj 1 ). Moreover, since xl z1 l~ 2 = y, x and y 



Proof, (i) Since xt 21 = yi 22 and w determines Tj 1 i zi 
phism between Tj 1 and Tj 2 . Therefore, j% 
are in the same orbit of Aut(T J1 ). 

(ii) For 1 < j < m, let Fj be the set of fixed points of w(a,b) of the form xl z , for some fixed vertex 
x G V(Tj) and z G Ij. By (i), the sets Fj are pairwise disjoint. Moreover, for z G Ij, i z contributes fixed (Tj) 
fixed points to Fj, giving a total count of \Ij\ ■ fixed (Tj). Part (i) also implies that any element of Fj occurs 
in this count at most Aut (Tj) times. □ 



2.4. Walks on graphs. In this subsection we consider graphs X that are symmetric and regular in the 
following sense. Symmetric means that for any two vertices x, y G V{X), the number of edges from x to y is 
the same as the number of edges from y to x. Regular of valency d means that each x G V(X) has in-degree 
and out-degree d. For x G V{X), we denote by A(x) the d-element multiset {y \ (x,y) G E(X)}. 

Definition 2.4. A lazy random walk on A is a discrete stochastic process where a particle moves from 
vertex to vertex in X. If, after k steps, the particle is at x G V(X) and A(x) = {y\, . . . ,yj} then at step 
k + 1 the particle 

(i) stays at x with probability 1/2; 

(ii) moves to vertex with probability l/(2d), for all i = 1, . . . , d. 

The asymptotic rate of convergence for the probability distribution of a particle in a lazy random walk 
on X is an important and well-studied problem in combinatorics and computer science (see |15j). For 
x,y G V(X), we write Pk{x,y) for the probability that the particle is at vertex y after k steps of a lazy 
random walk starting at x. For a fixed e > 0, the mixing time for e is the minimum value of k such that 

' -(l-e)< Pk (x,y)<—^—{l + e) 



\V{X)\ 



\V{X)\ 
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for all x,y E V{X). The following estimate is well-known; for a proof, see e.g. |13t Section 4]. 

Lemma 2.5. Let X be a connected, symmetric, and regular directed graph of valency d and with N vertices, 
and let e > 0. Then the mixing time for e is at most N 2 d\og(N / e) . 

For G = Sym(S7) or Alt(f2) and G = {S), we are interested in the following symmetric and regular directed 
graphs Xk, for positive integers k. Let V{Xk) := Sl(k), and E(Xk) := {(x,x 9 ) \ x E Q^) and g E S U S^ 1 }. 
Clearly X^ has no.) vertices, is connected, is symmetric, and is regular of valency \S U S \. 

It is useful to induce random walks on the graphs X^ for different k at the same time. This is done as 
follows. First, we choose a subset J C {1, 2, . . . , £}, where J is the set of steps when the particle moves to 
a neighbour of the current position as in Definition 12. 4f ii). The length j := \J\ is chosen from the binomial 
distribution B(£, 1/2) and then J itself is chosen from the uniform distribution on the j-element subsets of 
{1, 2, . . . , £}. Finally, for i E J, we choose gi E SU 5 _1 uniformly and use gi in the i th step to define the edge 
on which the particle moves. The overall effect, that is, the trajectory of a lazy random walk with initial 
position x E V(Xj t ) : is the same as computing the image of x under the permutation r = Y\ ieJ gf, we say 
that the permutation r is realised by the lazy random walk. The construction of r uses only the number t 
and S U S , so the permutation r can be considered as realised by lazy random walks in more than one 
graph Xk. Of course, these lazy random walks are not independent. 

Lemma 12.51 will be useful to us in the following form. 

Lemma 2.6. Let S be a set of generators of Sym(f2) or Alt(f2) of cardinality at most n, and let k be a 
positive integer. Fix < e < 1 and set £ > 2n 2fc+1 log(n fc /e). Then, if r E Sym(f2) is realised by a lazy 
random walk of length i on Xk, and x,y E fi(fc), then 

(1 - e)— < P{x r = y) < (1 + e) — . 

n {k) n (k) 

Proof. Recall that n^) = |^(^fc)| by definition, and nm < n k . Furthermore \S U < 2n and the proof 
follows from Lemma 12.51 □ 

Note that if r E Sym(J7) is realised by a lazy random walk of length I on X^, then r E (S U S^ 1 U {1})^- 

3. Primary machinery 

The results of this section provide the primary machinery for a proof of Theorem 11.31 The main step 
of the proof is that given a set S of generators for G = Sym(n) or Alt(n) and a E S of support size 
|supp(a)| = <5n, we would like to construct a permutation as a short word in S with support size less than 
5n. The permutations we consider are of the form w(a,a r ), for an appropriately chosen reduced word w in 
the symbols {a, a -1 , /3, In this section, we assume that w is given, and describe how to choose r E G 

such that w(a, a r ) has many fixed points. We obtain r as a permutation realised by a lazy random walk. 

Let w = w\W2 ■ ■ ■ Wk and let T = {T\, . . . ,T m } be a set of pairwise non-isomorphic a/3-trees admitting 
w. By Lemma l2.3l fii). we would like to choose r so that each T E T is hosted by (i,a, a r ) for many maps 
l : V{T) — )■ O. As a is fixed, it is beneficial first to examine embeddings of T a and Tp separately. 

We prove results for two kinds of permutations a. In the "generic" case, all non-trivial cycles of a are 
long, compared to the a- and /3-paths occurring in trees T and in the "special" case supp(a) consists of 
short cycles of equal length. We fix a small set A C $7 (in the application in Section HI |A| < 10) and require 
that r fixes A setwise and acts on A on some prescribed way. The purpose of prescribing the action of r on 
a small set is to ensure that w(a,a r ) is not trivial. (This trick has already been used in [2].) As the points 
in A play a special role, we are only interested in injections t : V(T) — > f2\A. First, we handle the "generic" 
case. 

Lemma 3.1. Let < <5o < 1/2 and let k,X,N be positive integers. Suppose that a E Sym($7) has no cycles 
of length less than N and that |supp(a)| = 5n, for some 5 E (5q, 1 — 5q). Let 7 E {a,/3} and let T be an 
aft-tree such that \V{T)\ < k, T has no j-cycles, and every j-path in T has at most N vertices. Let A C Q, 
|A| < A, and let 

S y (T) := {l : V(T) -)■ fi\ A | T 7 is hosted by (l, a)}. 

Then 

\S 7 (T)\ > C(So,K,X,N,n) (1 - ^ {T) <5^ (T) n l ^+^ T \ 
where C(Sq, k, A, N, n) is a function with lim n ^ 00 C(5o, k, X, N,n) = 1. 
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By lim^^oo C(5q, k, X, N, n), we mean that the variables So, k, A and N are fixed, and n goes to oo. 

Proof. Let v\, . . . , Wj m be the loops of T 7 and let Pi, ... , P Pj (t) be the directed paths of T 7 . We embed 
the components of T 7 into O \ A one- by-one, and estimate |<S 7 (T)| by the product of the number of possible 
embeddings at each step. 

The vertices Vi, for 1 < i < / 7 (T), have to be mapped to fixed points of a. As a has at least n — 5n — X fixed 
points outside A, v\u, . . . , »j mi can be chosen in at least (n— 5n— A)(n— 5n— A— 1) • • • (re— 5n— A— Z 7 (T) + 1) > 
(n — 5n — X — k) 1 ' 1 ^ distinct ways. 

Next, we consider the directed paths in T 7 . For 1 < i < p 7 (T), we write Pi = (wi t o, . . . , Wi )Ci ). Now, 
once the image of Wi t o under l is chosen, in order to guarantee that i hosts the path Pi, we require that 
Wijt = (wifit) aJ for each j = 0, . . . , Cj. Hence, for each i £ {1, . . . ,p 7 (T)}, the image of Pi under i is uniquely 
determined by Wi$i. Let Aj be the union of the sets P z i, for z < i. Since by hypothesis a has no cycles 
of length N — 1 or shorter and since T has no 7-path of length greater than N, the only requirement for 
choosing the image of u^o under 1 is that 

w it0 L? (J(A i UA)°" j 
i=o 

(since we have to avoid A and the t-images of previously mapped P z ). A gross overestimate for the size 
of this union is (c, + 1)(A + |y(T)|) < k\ + k 2 , and so Wifli can be chosen in at least 5n — kX — k? ways. 
Summarizing, we obtain 

|5 7 (T)| > (n — Sn-X - k) 1 ^ (Sn - kX - k 2 ) p ^ t \ 
By factoring out 5, (1 — 5), and n, we get 

\SJT)\ > C(5 ,K,X,N,n) (1 - 8) 1 ^ 5^ n L '^ + ^ T \ 



C(S , K ,X,N,n)= kA + k2 



where 

r^A, k \ N = I 1 — - 

Clearly, limn^oo C(6 ,K,X,N,n) = 1. □ 

The case of "special" permutations a is very similar. 

Lemma 3.2. Let < 5q < 1/2 and Zei k,X,N be positive integers. Suppose that every cycle of a £ Sym(f2) 
has length 1 or N and |supp(a)| = Sn, for some 5 S (<5o>l — o~o). Let 7 S {a, /3} and /et T fee an ctfi-tree 
such that \ V(T)\ < k, every ^-cycle in T has N vertices, and every ^-path in T has at most N vertices. Let 
ACU, |A| < A, and let 

5 7 (T) := {t : V(T) U \ A I T 7 is aosted 6w (i, a)}. 

\S y (T)\ > C(5 ,K,X,N,n) (1 - 5) 1 ^ 5^ n l ^ + ^ T \ 
where C(5q, k, X, N, re) is a function with linin^oo C(5q, k, X, N, re) = 1. 

Proof. We may follow almost verbatim the proof of Lemma 13.11 The only difference is that the list 
Pi, ... , Pp 7 (T) ma y a l so contain 7-cycles, so we have to change slightly the definition of the vertices Wi.Q. If 
Pi is a 7-path then Wi$ is the starting vertex of the path as before, while if P, is a 7-cycle then Wi$ can be 
chosen as an arbitrary vertex of Pj. Since the 7-cycles of T 7 have the same length as the cycles in supp(a), 
the rest of the proof goes through without any modification. □ 

Now we are ready to prove the main result of this section. Let < 5q < 1/2 and k,X,N £ Z + be fixed, 
and let w = w\ ■ ■ ■ be a reduced word in {a, a -1 , f3, Suppose further that T = {Ti, . . . , T m } is a 

set of a/3-trees admitting w and |^(P)| < k for all T S T. Let G = Sym(n) or Alt(n) be generated by a 
set S of cardinality at most re. We do not have to distinguish the two cases ("generic" and "special") for a 
anymore, so let a £ Sym(O) with |supp(a)| = Sn for some 5 £ (<5o, 1 — <^o) an d suppose that either 

• all non-trivial cycles in a have length at least N, none of the a/3-trees T £ T have any cycles, and 
every a- and /3-path in T has at most vertices; or 

• every cycle of a has length 1 or N, for all T £ T every a- and /3-cycle has N vertices, and every a- 
and /3-path has at most N vertices. 
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Let A C f2, |A| < A, let g E Sym(A), and let Sy(T) be as in Lemmas 13. II and 13.21 Finally, let an error bound 
e > be given. We "collect" errors in estimates from different sources, so we choose e' < e such that 

1 + e' 

Moreover, we may assume that n is larger than a bound no(5o, k, A, N, e) depending only on 5q, k, A, N, and 
e such that: 

(i) For all 7 E {a, /3} and for all T E 7~, 

(3.2.1) |S 7 (T)| > (1 - e') (1 - 8) 1 ^ 5^ n h(T)+P^(T)_ 

(Note that this inequality is satisfied by large enough n, by Lemmas 13.11 and 13.21 ) 

(ii) n 2 ( K+A+1 ) > 2n 2 ( K+A )+ 1 log(n K+A /e / )- 

Recall that S T = (1 - J)'«(T)+W T ) >5P*( t )+pp( t ) an d that fixed(T) denotes the number of fixed vertices of 
(T,w). 

Theorem 3.3. With the notation of the previous paragraph, there exists r E Sym(f2) realised by a lazy 
random walk of length n 2 ( K + A+1 ) such that r\\ = g and 



™ 5 T • fixed (T 

fixKfl,fl r ))l>(l-g)» E ] A ut(7T ' 



.1 > 



r- 

By t|a we mean the restriction of the permutation r (considered as a function r : f2 — > $7) to A. 

Proof. Let r be realised by a lazy random walk of length £ := ra 2 ( K + A +!) . Our main goal is to give an estimate 
for the conditional expectation E(| fix(w(a,a r ))\ \ r\\ = g). 

Let T E T and l E S a (T) be arbitrary but fixed. First, we give an estimate for the probability that T is 
hosted by (i,a,a r ). By Lemma I2TE1 for any x,y E (SI \ A)nyr T \\\, 

Prob(x r = y A t|a = 5) > (1 — e') and (1 + e') — - — > Prob(r|A = g)- 

n(\A\+\V(T)\) "(|A|) 

Hence, for the conditional probability Prob(x r = y \ r\\ = g), we have 

, , , , , 1 - e' 1 1 - e' 1 

(3-3.1) Prob(x r = y\ r\ A = g)> - — > 



l + e /( n _|A|) (|y(T)|) i + £ ' n \v(T)\- 

Since by definition T a is hosted by (l, a), T is hosted by (l, a, a r ) if and only if Tp is hosted by (l, a r ); this 
is equivalent to 

(xL) aT = yi for all (x,y) E E(Tp) 

^^y-'ar = yi for aU g £( T/? ) 

^(xO r " a = for all (x,y) E E(T fi ) 

the function V(T) -> fi,s H- (xi) r is in Sp(T) 
^(V{T)L z ) r = V(T)l for some t z E 5^(T). 

Lemma [273] implies that for different i Zl , l Z2 E Sp(T), the events (F(T)t 2 J r = V(T)t are disjoint. Therefore, 
also using (|3.2.ip and (|3.3.ip . 

Prob(T is hosted by (4, a, a r ) | r| A = g) = Prob ((V(T)l z ) v = V{T)l \ r\ A = g) 

(I _ e ')2 (1 _ X)lf}(T) xpp (T) n lp(T)+pp(T) 

< 3 - 3 - 2 ' > \ + J am • 
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Next, by Lemma E3pi) and (|3XH . E2l) . 

E(|fix(^(a,a r ))| 



m 



r|.\ = 5) > ^ ^ Prob(T 3 - is hosted by (1, a, a r ) r| A = g) f^7^ 

(1 _ £ 'f (1 _ ^M^O+WT,) (JPafTfJ+p/jfTj) ^(T^+l^T^+p^+p^Tj) fj xed ( T , 



> 



E l 1 - fc J 
1 + e' 



nl y ^)l I Aut (7})|' 



Finally by LemmaS i a (Ty) + lp(Tj) + p a {Tj ) + pp (Tj ) = 1^)1 + 1, yielding 

E(| fixMa, o r ))| I r| A = g) > (1 - e) n ^^rjf ' 

To finish the proof of the theorem, we simply take r that gives at least the expected number of fixed 
points. □ 

4. Proof of Theorem 11.31 

In this section we apply Theorem 13.31 to prove Theorem 11.31 We start with two technical lemmas. 

Lemma 4.1. Let m be an integer and take g,h E Sym(m). In each of the following cases, [h, (h 9 )~ l ][h, h 9 ] 
contains a 7-cycle: 

(1) m > 7, h = (1, 2, 3, ... , m), g contains the cycle (1, 3, m) and fixes the points 2, 4, 5, 6, m — 3, m — 
2,m- 1; 

(2) m = 7, / l = (l,2,3,4,5)(6)(7), g = (1, 6)(3, 7)(2)(4)(5); 

(3) m = 7, h= (1,2,3)(4,5,6)(7), g = (1, 7, 2, 4)(3)(5)(6); 

(4) m = 7, /i = (1,2)(3,4)(5,6)(7), 5 = (1, 5, 7, 2, 3)(4)(6). 

Proof. To show (1), we note simply that (l,m,5,3,m— 1,4,2) is a 7-cycle of [h, (h 9 )~ l ]\h, h 9 ]. Parts (2), (3) 
and (4) follow easily. □ 

Note that in each case in Lemma 14.11 we prescribe the action of g on at most 10 points. 
For < 5 < 1, define 

f(5) :=(1 - 5f + (1 - 8f5 + (1 - 5f8 + 4(1 - 5) 4 5 2 + 2(1 - 5) 2 5 3 

+ 3(1 - 5) 5 5 3 + 10(1 - 5) 9 5 4 + 26(1 - 5) 7 5 5 + 20(1 - 5) s 5 5 

(4.1.1) + 6(1 - 5) 5 J 6 + 16(1 - 5f5 6 + 40(1 - 5) 8 5 6 + 3(1 - 5) 4 <5 7 

+ 8(1 - 5f5 7 + 20(1 - 5) 7 ^ 7 + 10(1 - 5) s 5 9 + 20(1 - <5) 7 £ 10 

+ 10(1 -5f5 u + 15(1 -5) 7 <5 n . 

Lemma 4.2. 

(1) T/ie function 5 1— > 1 — 0.999/(5) is monotone increasing on the interval (0, 1). 

(2) T/ie equation 0.999/(5) = 1 — 5 has a unique solution in (0,1). Up to six significant digits, the 
solution is 5 = 0.632599. 

(3) Starting with 5 = 0.63, nine iterations of the function 5 1— > 1 — 0.999/(5) reach a value less than 
0.326. 



Proof. All three results can be established using an algebra package such as Sage jS + 09j . □ 
We are now ready to prove Theorem II .31 



Proof of Theorem \1.3[ Let a be an element of S with |supp(a)| < Cn = 0.63n. We shall apply the results 
of Section [3] for the word w = Wq°, where u>o = wo(a,P) = [a, [a, /?]. 

The proof splits into a number of cases, according to the order of a. Let \a\ = 2 ei 3 62 e3, with e% coprime to 
6. Note that 2 ei < n and 3 62 < n. Suppose first that e% > 1. Then a 2 13 2 is non-trivial, and we may replace 
a with a 2 " 13 " 2 and assume that the order of a is coprime to both 2 and 3 (note that a 2 " 13 " 2 G (SuS~ 1 Ll{l}) n ). 

In Tables [U [21 [3l HI and[5l we present a family T of a/3-trees admitting Wq°. These a/3-trees were obtained 
with the help of a computer; note that our conventions for representing them are outlined in Subsection 12.31 
Note too that if an a/3-graph admits Wq then it also admits wff for any positive integer k. The a/3-trees 



10 



J. BAMBERG, N. GILL, T. P. HAYES, H. A. HELFGOTT, A. SERESS, AND P. SPIGA 



in these tables are pairwise non-isomorphic, do not have non-identity automorphisms, contain at most 16 
vertices, and have all a-paths and /3-paths of length at most 4. For this family, 

5 T ■ fixed (T) 



E 



,_ T Au.COl 



where f(5) is as in ()4.1.1|) . If a has a cycle of length m > 7 then relabel the letters of O so that this cycle is 
equal to (1, ... , to,), define A = {1, 2, 3, 4, 5, 6, to — 3, m — 2, m — 1, m} and let <? = (1, 3, to) (2) (4) (5) (6) (m — 
3)(m — 2) (to — 1) G Sym(A). If this is not the case, then we must have = 5; in this case relabel the letters 
of £1 so that a contains the cycle (1, 2, 3, 4, 5) and a fixes both 6 and 7. Then define A = {1, . . . , 7} and let 
= (1,6)(3,7)(2)(4)(5). 

By Lemma [24] we may suppose that \S\ < n. We apply Theorem 13.31 with 5q = 0.3, w = Wq°, T, A" = 4, 
A = 10, g, k = 16, and e = 0.001. We obtain r 6 (5U S^ 1 U {l})™ 54 such that | Sx(w(a, a r ))| > 0.999/(5). 
Note that w(o,o r ) G (SU^ 1 U {l})™ 54 +™ 2 and, of course, n 54 + n 2 = 0(n 54 ). By LemmaSIJ © and ©, 
to(a, a r ) is non-trivial because it contains a 7-cycle. Replacing a by w(a, a r ) and using the same procedure as 
above, Lemma [4.21 part ([3]), implies that in at most nine iterations we obtain a permutation a' with support 
size less than 0.326n. Each iteration increases the word length by a factor 0(n 2 ), as we may have to raise 
the input permutation to a suitable power to eliminate 2 and 3 from the cycle lengths, while conjugating by 
(a new) r and substituting into the word w contributes only constant multipliers to the word length. Hence 
a 1 is a word in S of length 0(n 70 ) and, by Theorem 11.21 diam(r(G, S)) = 0(n 78 ). 

Suppose next that = 1, that is, a has order 2 ei 3 62 . If e x > 0, let k = 2 ei " 1 3 e2 , otherwise let k = 3 e2_1 . 
Then k < n 2 , a k has order 2 or 3, and a k G (S U S' 1 U {l})" 2 . 

In Tables [6] and [71 we present two families of a/3-trees admitting w = w® , represented as before. The oc- 
trees in Table [6] (resp. Table [7J are pairwise non-isomorphic, contain at most 10 vertices, have all a-cycles 
and /3-cycles of length 2 (resp. 3), and have all a-paths and /3-paths of length at most 1 (resp. 2). 

In each cell of Tables [6] and O we have written Aut = k to mean that the automorphism group of the 
corresponding a/3-tree T has size k. We define T to be the set of a/3-trees in Table [6] (resp. Table [7J when 
a has order 2 (resp. has order 3). For these families, 



E 



where 



h{5) 



,_ T Aut(V)| 



(1 - (5) 2 (1 + 25 + 35 2 + 4<5 3 + 5<5 4 + 6<5 5 ) if a has order 2; 



(1 - 5) 2 + 5(1 - 5) 2 + 5(1 - 5) 3 + 25 3 (1 - 5) 2 
+A5 2 (1 - J) 4 + 3£ 3 (1 - 5f + 125 7 (1 - 5) A if a has order 3. 
+Q5 & (l-5f + 5\l-5f 

Define A = {1, . . . ,7}. If a has order 2 label the elements of so that a\\ = (1, 2)(3, 4)(5, 6)(7); then 
define g = (1, 5, 7, 2, 3)(4)(6). If a has order 3 label the elements of Q so that a\\ = (1, 2, 3) (4, 5, 6) (7); then 
define g = (1,7,2, 4)(3)(5)(6). 

By Lemma [2. H we may suppose that \S\ < n. We define N to equal 1 (resp. 2) when a has order 2 (resp. 
has order 3). We apply Theorem EJ with 5 = 0.3, w = w§°, T, N, A = 7, g, k = 10, and e = 0.001. We 
obtain r G (S U S' 1 U {l})™ 36 such that | fix(w(o, a r ))| > 0.999/i(<5). 

Using Sage [S+09] it is easy to check that the function <5 • — >- 1 — 0.999/i(e)) is monotone increasing on the 
interval (0,1). Furthermore, for 5 < 0.63, we have 0.999h(5) > 0.374 and so \supp(w(a,a r )\ < 0.626. Note 
that w(a,a r ) G (5U S~ 1 )°( n ' and, by Lemma [4TH ([3]) and ([!]), w(a,a T ) contains a 7-cycle. 

We now run the first part of the argument using the element w(a, a r ) instead of a as our initial element 
of small support. After one iteration we obtain an element a' G (SU S' 1 U {i})™ 2 0(n 36 )+n 54 with support of 
size less than 1 — 0.999/(5). Iterating as before, we obtain that diam(r(G, S)) = 0(n 78 ). □ 



5. Improving Theorem 11.31 

It should be clear to the reader that Theorem 11.31 is not optimal. In particular we prove Theorem 11.3 
with respect to a particular word, w = Wq° where wo = [a, /3 _1 ][a, /3]; it is this word that yields the value 
C = 0.63. How might one go about improving this value? 
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The most obvious way of improving Theorem 11.31 is via an appeal to higher powers. Consider Wk = Wq 
where k is any multiple of 60. All of the trees in Tables [H El [3l 0] and [5] admit Wf- and, for suitable choices 
of k, there will be yet more trees to consider. This will inevitably result in an increase for the value of C. 

There is a limit to the improvement that such a strategy might yield and we briefly explain why this is 
the case. Fix a word uo, let k be some positive integer, and define the word Vk = Vq. Let T be a set of 
a/3-trees admitting Vk and consider the sum 

S T - fixed (T) 



(5.0.1) g k (5) = ]T 



7 . er ,Aut(T)| • 

The main result of Section [31 Theorem 13.31 gives a lower bound for | hx(w(a,a r ))\ in terms of <?fc(<5), e and 
n; here a is an element of support equal to 5n and r is some short word. In particular, if the value of <7&(£) 
exceeds the value of 1 — 5 (by a sufficient margin in terms of e) then we obtain an element of smaller support 
than a. 

The advantage of considering higher powers of the word vq is exhibited in Theorem 13.31 by noting that if, 
say, k doubles, then new a/3-trees may be added to T, thereby increasing the value of for S £ (0, 1). 

Using methods different to those in this paper, the authors have developed a method to show that, in 
the "generic" case (see Section [3] for an explanation of what we mean by this), there is a number 5o £ (0, 1) 
such that 

lim sup gfc (5 ) < 1 — 5 

k— >oo 

whenever 5 > 5q. It is important to note that the number 5o depends only on the word vq. Furthermore the 
number 5q corresponds to the unique all-positive solution to a certain system of polynomial equations with 
rational coefficients and this system depends only on the word vq. 

In the case vo = wo = [a, /3 _1 ][a, /3], our method shows that 5q ~ 0.64242. One can see, then, that the 
value for C given in Theorem 11.31 is close to optimal when it comes to powers of the word wo. 

In another direction one might hope to improve Theorem 11.31 using an entirely different choice of word 
wq. With this in mind we undertook an exhaustive search of words of length at most 20 which were balanced 
(i.e. a,a _1 ,/3 and all occur the same number of times) and in which a^ 1 and f3 ±l occur alternately. 
For each such word w±, we performed the following computer experiment. For a variety of values n in the 
range 10 4 < n < 10 5 and 6 in the range 0.55 < 5 < 0.70, we took a random permutation a < S n with 
|supp(a)| = 5n, we constructed b as a random conjugate of a, and counted how many points occur in cycles 
of length at most 6 in the permutation W\(a, b). The highest counts occured for wo = [a, /3 _1 ][a, 0\ and for 
related words (like the cyclic permutations of wq or Wq) and this is the reason for our use of the word wq in 
the preceding proof. 

We also carried out a non-exhaustive investigation into words that were either non-balanced or non- 
alternating. In every case, for a word W\ of this kind, and for a and b as above, the computer tests suggested 
that permutations of the form wi(a,b) tended to have a smaller number of points in short cycles than 
w (a,b). 
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Table 2. a/3-trees admitting w 2 but not w 
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I 




a 5 (i-<?) 7 



9 * 



V • 

J 6 (l — <5) 6 





a 6 (l-£) 6 



5 5 (l-<5) 7 



5 5 (l-5) 7 



s 5 (i-sy 



T 

l 




^ 7 (i-(5) 6 




(5 7 (1-J) 6 



Table 5: a/3-trees admitting tw but not w 



1 




V I 

S 5 (l -S) r 




S 5 (l-S) 7 





1 1 



<5 5 (l-<5) 6 




; : i 




T 



t/i 



<5 4 (i -sy j 



y i 

5 4 (l-<5) 9 
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<P(l-5) 7 




<5 n (l-5) 7 



Table 6: a/3-trees for an involution 

















It 






S°(l-S) 2 
Aut = 1 


i 

S^l-S) 2 
Aut = 2 


11 

S^l-S) 2 
Aut = 2 


i 

<5 2 (l-<5) 2 
Aut = 1 


i 

«5 3 (l-5) 2 
Aut = 2 






$ 






- S) 2 
Aut = 2 


i 

S 4 (l-S) 2 
Aut = 1 


ir 

<5 5 (1-<S) 2 
Aut = 2 


5 5 (1 - <5) 2 
Aut = 2 





Table 7: a/3-graphs for an element of order 3 



Aut = 1 



S 3 (l-5) 2 
Aut = 1 



6°(1-S) 2 
Aut = 1 



A 



«5 3 (1- 
Aut 



5) 2 
= 1 



Aut = 1 



<5 2 (1 - <S) 4 
Aut = 1 
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A 



<5 7 (l-<5) 4 
Aut = 1 




5 6 (1-S) 4 
Aut = 1 




6 6 (l~S) 4 
Aut = 1 




S 4 (l-S) 6 
Aut = 3 




T 

J 




S 7 (l -S) 4 
Aut = 1 



S 7 (l-8) 4 
Aut = 1 
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